Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 3678 |
| Missing cells | 6713 |
| Missing cells (%) | 7.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 689.6 KiB |
| Average record size in memory | 192.0 B |
Variable types
| Categorical | 10 |
|---|---|
| Text | 3 |
| Numeric | 10 |
store room is highly imbalanced (55.7%) | Imbalance |
facing has 1046 (28.4%) missing values | Missing |
super_built_up_area has 1803 (49.0%) missing values | Missing |
built_up_area has 1988 (54.1%) missing values | Missing |
carpet_area has 1805 (49.1%) missing values | Missing |
area is highly skewed (γ1 = 29.73497204) | Skewed |
built_up_area is highly skewed (γ1 = 40.70657243) | Skewed |
carpet_area is highly skewed (γ1 = 24.33971847) | Skewed |
floorNum has 129 (3.5%) zeros | Zeros |
luxury_score has 462 (12.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-23 15:06:04.319707 |
|---|---|
| Analysis finished | 2024-05-23 15:06:12.351265 |
| Duration | 8.03 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
property_type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| flat | |
|---|---|
| house |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.2335508 |
| Min length | 4 |
Characters and Unicode
| Total characters | 15571 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | house |
|---|---|
| 2nd row | flat |
| 3rd row | flat |
| 4th row | house |
| 5th row | flat |
Common Values
| Value | Count | Frequency (%) |
| flat | 2819 | |
| house | 859 | 23.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| flat | 2819 | |
| house | 859 | 23.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15571 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15571 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15571 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 2819 | |
| l | 2819 | |
| a | 2819 | |
| t | 2819 | |
| h | 859 | 5.5% |
| o | 859 | 5.5% |
| u | 859 | 5.5% |
| s | 859 | 5.5% |
| e | 859 | 5.5% |
society
Text
| Distinct | 676 |
|---|---|
| Distinct (%) | 18.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 57.5 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 39 |
| Mean length | 16.873538 |
| Min length | 1 |
Characters and Unicode
| Total characters | 62044 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 308 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | dlf city plots |
|---|---|
| 2nd row | imperia the esfera |
| 3rd row | alpha corp gurgaonone |
| 4th row | independent |
| 5th row | vatika sovereign next sector-82 a gurgaon |
| Value | Count | Frequency (%) |
| independent | 491 | 5.1% |
| the | 350 | 3.6% |
| dlf | 220 | 2.3% |
| park | 209 | 2.2% |
| city | 166 | 1.7% |
| emaar | 155 | 1.6% |
| global | 154 | 1.6% |
| m3m | 152 | 1.6% |
| signature | 151 | 1.6% |
| heights | 134 | 1.4% |
| Other values (783) | 7499 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6713 | 10.8% |
| 6006 | 9.7% | |
| a | 5865 | 9.5% |
| r | 4174 | 6.7% |
| n | 4165 | 6.7% |
| i | 3831 | 6.2% |
| t | 3720 | 6.0% |
| s | 3473 | 5.6% |
| l | 2945 | 4.7% |
| o | 2757 | 4.4% |
| Other values (31) | 18395 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 55493 | |
| Space Separator | 6006 | 9.7% |
| Decimal Number | 527 | 0.8% |
| Other Punctuation | 10 | < 0.1% |
| Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6713 | |
| a | 5865 | 10.6% |
| r | 4174 | 7.5% |
| n | 4165 | 7.5% |
| i | 3831 | 6.9% |
| t | 3720 | 6.7% |
| s | 3473 | 6.3% |
| l | 2945 | 5.3% |
| o | 2757 | 5.0% |
| d | 2489 | 4.5% |
| Other values (16) | 15361 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 207 | |
| 2 | 82 | 15.6% |
| 1 | 75 | 14.2% |
| 6 | 56 | 10.6% |
| 8 | 32 | 6.1% |
| 4 | 19 | 3.6% |
| 5 | 17 | 3.2% |
| 0 | 13 | 2.5% |
| 9 | 13 | 2.5% |
| 7 | 13 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 | |
| / | 2 | 20.0% |
| . | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 6006 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 55493 | |
| Common | 6551 | 10.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6713 | |
| a | 5865 | 10.6% |
| r | 4174 | 7.5% |
| n | 4165 | 7.5% |
| i | 3831 | 6.9% |
| t | 3720 | 6.7% |
| s | 3473 | 6.3% |
| l | 2945 | 5.3% |
| o | 2757 | 5.0% |
| d | 2489 | 4.5% |
| Other values (16) | 15361 |
Common
| Value | Count | Frequency (%) |
| 6006 | ||
| 3 | 207 | 3.2% |
| 2 | 82 | 1.3% |
| 1 | 75 | 1.1% |
| 6 | 56 | 0.9% |
| 8 | 32 | 0.5% |
| 4 | 19 | 0.3% |
| 5 | 17 | 0.3% |
| 0 | 13 | 0.2% |
| 9 | 13 | 0.2% |
| Other values (5) | 31 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62044 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6713 | 10.8% |
| 6006 | 9.7% | |
| a | 5865 | 9.5% |
| r | 4174 | 6.7% |
| n | 4165 | 6.7% |
| i | 3831 | 6.2% |
| t | 3720 | 6.0% |
| s | 3473 | 5.6% |
| l | 2945 | 4.7% |
| o | 2757 | 4.4% |
| Other values (31) | 18395 |
sector
Text
| Distinct | 114 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
Length
| Max length | 26 |
|---|---|
| Median length | 9 |
| Mean length | 9.3140294 |
| Min length | 3 |
Characters and Unicode
| Total characters | 34257 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | sector 26 |
|---|---|
| 2nd row | sector 37c |
| 3rd row | sector 84 |
| 4th row | sector 110 |
| 5th row | sector 82a |
| Value | Count | Frequency (%) |
| sector | 3449 | |
| road | 178 | 2.4% |
| sohna | 166 | 2.2% |
| 85 | 108 | 1.5% |
| 102 | 107 | 1.4% |
| 92 | 100 | 1.4% |
| 69 | 93 | 1.3% |
| 90 | 89 | 1.2% |
| 65 | 87 | 1.2% |
| 81 | 87 | 1.2% |
| Other values (107) | 2916 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3804 | |
| 3702 | ||
| s | 3694 | |
| r | 3694 | |
| e | 3543 | |
| c | 3500 | |
| t | 3460 | |
| 1 | 1076 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 780 | 2.3% |
| Other values (21) | 6202 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23292 | |
| Decimal Number | 7263 | 21.2% |
| Space Separator | 3702 | 10.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3804 | |
| s | 3694 | |
| r | 3694 | |
| e | 3543 | |
| c | 3500 | |
| t | 3460 | |
| a | 698 | 3.0% |
| d | 249 | 1.1% |
| n | 225 | 1.0% |
| h | 203 | 0.9% |
| Other values (10) | 222 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1076 | |
| 0 | 802 | |
| 8 | 780 | |
| 9 | 764 | |
| 6 | 740 | |
| 7 | 682 | |
| 2 | 676 | |
| 3 | 666 | |
| 5 | 593 | |
| 4 | 484 |
Space Separator
| Value | Count | Frequency (%) |
| 3702 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23292 | |
| Common | 10965 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3804 | |
| s | 3694 | |
| r | 3694 | |
| e | 3543 | |
| c | 3500 | |
| t | 3460 | |
| a | 698 | 3.0% |
| d | 249 | 1.1% |
| n | 225 | 1.0% |
| h | 203 | 0.9% |
| Other values (10) | 222 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 3702 | ||
| 1 | 1076 | 9.8% |
| 0 | 802 | 7.3% |
| 8 | 780 | 7.1% |
| 9 | 764 | 7.0% |
| 6 | 740 | 6.7% |
| 7 | 682 | 6.2% |
| 2 | 676 | 6.2% |
| 3 | 666 | 6.1% |
| 5 | 593 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34257 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3804 | |
| 3702 | ||
| s | 3694 | |
| r | 3694 | |
| e | 3543 | |
| c | 3500 | |
| t | 3460 | |
| 1 | 1076 | 3.1% |
| 0 | 802 | 2.3% |
| 8 | 780 | 2.3% |
| Other values (21) | 6202 |
price
Real number (ℝ)
| Distinct | 473 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5330811 |
| Minimum | 0.07 |
|---|---|
| Maximum | 31.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0.07 |
|---|---|
| 5-th percentile | 0.37 |
| Q1 | 0.95 |
| median | 1.52 |
| Q3 | 2.75 |
| 95-th percentile | 8.5 |
| Maximum | 31.5 |
| Range | 31.43 |
| Interquartile range (IQR) | 1.8 |
Descriptive statistics
| Standard deviation | 2.9804249 |
|---|---|
| Coefficient of variation (CV) | 1.1766007 |
| Kurtosis | 14.935884 |
| Mean | 2.5330811 |
| Median Absolute Deviation (MAD) | 0.72 |
| Skewness | 3.2794161 |
| Sum | 9273.61 |
| Variance | 8.8829326 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.25 | 80 | 2.2% |
| 1.5 | 64 | 1.7% |
| 1.2 | 64 | 1.7% |
| 0.9 | 63 | 1.7% |
| 1.1 | 62 | 1.7% |
| 1.4 | 60 | 1.6% |
| 1.3 | 57 | 1.5% |
| 0.95 | 52 | 1.4% |
| 2 | 52 | 1.4% |
| 1.6 | 48 | 1.3% |
| Other values (463) | 3059 |
| Value | Count | Frequency (%) |
| 0.07 | 1 | < 0.1% |
| 0.16 | 1 | < 0.1% |
| 0.17 | 1 | < 0.1% |
| 0.19 | 1 | < 0.1% |
| 0.2 | 8 | |
| 0.21 | 6 | |
| 0.22 | 8 | |
| 0.23 | 1 | < 0.1% |
| 0.24 | 6 | |
| 0.25 | 11 |
| Value | Count | Frequency (%) |
| 31.5 | 1 | < 0.1% |
| 27.5 | 1 | < 0.1% |
| 26 | 2 | |
| 25 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 20 | 3 | |
| 19.5 | 2 | |
| 19 | 3 |
price_per_sqft
Real number (ℝ)
| Distinct | 2651 |
|---|---|
| Distinct (%) | 72.4% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13890.884 |
| Minimum | 4 |
|---|---|
| Maximum | 600000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4716 |
| Q1 | 6818 |
| median | 9020 |
| Q3 | 13878 |
| 95-th percentile | 33333 |
| Maximum | 600000 |
| Range | 599996 |
| Interquartile range (IQR) | 7060 |
Descriptive statistics
| Standard deviation | 23207.147 |
|---|---|
| Coefficient of variation (CV) | 1.6706747 |
| Kurtosis | 186.97513 |
| Mean | 13890.884 |
| Median Absolute Deviation (MAD) | 2793 |
| Skewness | 11.438605 |
| Sum | 50854525 |
| Variance | 5.3857169 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000 | 27 | 0.7% |
| 8000 | 19 | 0.5% |
| 5000 | 17 | 0.5% |
| 12500 | 14 | 0.4% |
| 11111 | 13 | 0.4% |
| 6666 | 13 | 0.4% |
| 22222 | 13 | 0.4% |
| 7500 | 12 | 0.3% |
| 8333 | 12 | 0.3% |
| 6000 | 11 | 0.3% |
| Other values (2641) | 3510 | |
| (Missing) | 17 | 0.5% |
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 5 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 53 | 1 | |
| 57 | 1 | |
| 58 | 2 | |
| 60 | 1 | |
| 61 | 1 | |
| 79 | 1 |
| Value | Count | Frequency (%) |
| 600000 | 1 | |
| 400000 | 1 | |
| 315789 | 1 | |
| 308333 | 1 | |
| 290948 | 1 | |
| 283333 | 1 | |
| 266666 | 1 | |
| 261194 | 1 | |
| 245398 | 1 | |
| 241666 | 1 |
area
Real number (ℝ)
SKEWED 
| Distinct | 1312 |
|---|---|
| Distinct (%) | 35.8% |
| Missing | 17 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2887.6908 |
| Minimum | 50 |
|---|---|
| Maximum | 875000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 519 |
| Q1 | 1230 |
| median | 1733 |
| Q3 | 2300 |
| 95-th percentile | 4246 |
| Maximum | 875000 |
| Range | 874950 |
| Interquartile range (IQR) | 1070 |
Descriptive statistics
| Standard deviation | 23164.373 |
|---|---|
| Coefficient of variation (CV) | 8.0217637 |
| Kurtosis | 942.28488 |
| Mean | 2887.6908 |
| Median Absolute Deviation (MAD) | 533 |
| Skewness | 29.734972 |
| Sum | 10571836 |
| Variance | 5.3658819 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1650 | 54 | 1.5% |
| 1350 | 48 | 1.3% |
| 1800 | 47 | 1.3% |
| 3240 | 43 | 1.2% |
| 1950 | 43 | 1.2% |
| 2700 | 39 | 1.1% |
| 900 | 38 | 1.0% |
| 2000 | 33 | 0.9% |
| 2250 | 25 | 0.7% |
| 2400 | 23 | 0.6% |
| Other values (1302) | 3268 |
| Value | Count | Frequency (%) |
| 50 | 4 | |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 2 | |
| 61 | 1 | < 0.1% |
| 67 | 2 | |
| 70 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 875000 | 1 | |
| 642857 | 1 | |
| 620000 | 1 | |
| 566667 | 1 | |
| 215517 | 1 | |
| 98978 | 1 | |
| 82781 | 1 | |
| 65517 | 2 | |
| 65261 | 1 | |
| 58228 | 1 |
areaWithType
Text
| Distinct | 2355 |
|---|---|
| Distinct (%) | 64.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
Length
| Max length | 124 |
|---|---|
| Median length | 119 |
| Mean length | 54.230016 |
| Min length | 12 |
Characters and Unicode
| Total characters | 199458 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1849 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Plot area 360(301.01 sq.m.) |
|---|---|
| 2nd row | Super Built up area 1815(168.62 sq.m.)Built Up area: 1400 sq.ft. (130.06 sq.m.)Carpet area: 1270 sq.ft. (117.99 sq.m.) |
| 3rd row | Super Built up area 1963(182.37 sq.m.)Carpet area: 1700 sq.ft. (157.94 sq.m.) |
| 4th row | Built Up area: 500 (46.45 sq.m.) |
| 5th row | Super Built up area 4350(404.13 sq.m.)Carpet area: 3480 sq.ft. (323.3 sq.m.) |
| Value | Count | Frequency (%) |
| area | 5574 | |
| sq.m | 3656 | |
| up | 3020 | 10.0% |
| built | 2316 | 7.7% |
| super | 1875 | 6.2% |
| sq.ft | 1751 | 5.8% |
| sq.m.)carpet | 1185 | 3.9% |
| sq.m.)built | 702 | 2.3% |
| carpet | 684 | 2.3% |
| plot | 681 | 2.3% |
| Other values (2846) | 8702 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26468 | 13.3% | |
| . | 20393 | 10.2% |
| a | 13157 | 6.6% |
| r | 9458 | 4.7% |
| e | 9322 | 4.7% |
| 1 | 9205 | 4.6% |
| s | 7568 | 3.8% |
| q | 7432 | 3.7% |
| t | 7325 | 3.7% |
| u | 6770 | 3.4% |
| Other values (25) | 82360 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 82770 | |
| Decimal Number | 47143 | |
| Space Separator | 26468 | 13.3% |
| Other Punctuation | 23411 | 11.7% |
| Uppercase Letter | 8594 | 4.3% |
| Close Punctuation | 5536 | 2.8% |
| Open Punctuation | 5536 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13157 | |
| r | 9458 | |
| e | 9322 | |
| s | 7568 | |
| q | 7432 | |
| t | 7325 | |
| u | 6770 | |
| p | 6768 | |
| m | 5545 | |
| l | 3701 | 4.5% |
| Other values (5) | 5724 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9205 | |
| 0 | 6629 | |
| 2 | 5688 | |
| 5 | 4718 | |
| 3 | 3962 | |
| 4 | 3712 | |
| 6 | 3674 | 7.8% |
| 7 | 3254 | 6.9% |
| 8 | 3157 | 6.7% |
| 9 | 3144 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 3020 | |
| S | 1875 | |
| C | 1873 | |
| U | 1145 | 13.3% |
| P | 681 | 7.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20393 | |
| : | 3018 | 12.9% |
Space Separator
| Value | Count | Frequency (%) |
| 26468 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5536 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 108094 | |
| Latin | 91364 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13157 | |
| r | 9458 | |
| e | 9322 | |
| s | 7568 | |
| q | 7432 | |
| t | 7325 | |
| u | 6770 | |
| p | 6768 | |
| m | 5545 | 6.1% |
| l | 3701 | 4.1% |
| Other values (10) | 14318 |
Common
| Value | Count | Frequency (%) |
| 26468 | ||
| . | 20393 | |
| 1 | 9205 | 8.5% |
| 0 | 6629 | 6.1% |
| 2 | 5688 | 5.3% |
| ) | 5536 | 5.1% |
| ( | 5536 | 5.1% |
| 5 | 4718 | 4.4% |
| 3 | 3962 | 3.7% |
| 4 | 3712 | 3.4% |
| Other values (5) | 16247 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 199458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 26468 | 13.3% | |
| . | 20393 | 10.2% |
| a | 13157 | 6.6% |
| r | 9458 | 4.7% |
| e | 9322 | 4.7% |
| 1 | 9205 | 4.6% |
| s | 7568 | 3.8% |
| q | 7432 | 3.7% |
| t | 7325 | 3.7% |
| u | 6770 | 3.4% |
| Other values (25) | 82360 |
bedRoom
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3597064 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.8975034 |
|---|---|
| Coefficient of variation (CV) | 0.5647825 |
| Kurtosis | 18.215499 |
| Mean | 3.3597064 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.4853698 |
| Sum | 12357 |
| Variance | 3.600519 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1496 | |
| 2 | 943 | |
| 4 | 660 | |
| 5 | 210 | 5.7% |
| 1 | 124 | 3.4% |
| 6 | 74 | 2.0% |
| 9 | 41 | 1.1% |
| 8 | 30 | 0.8% |
| 7 | 28 | 0.8% |
| 12 | 28 | 0.8% |
| Other values (9) | 44 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 124 | 3.4% |
| 2 | 943 | |
| 3 | 1496 | |
| 4 | 660 | |
| 5 | 210 | 5.7% |
| 6 | 74 | 2.0% |
| 7 | 28 | 0.8% |
| 8 | 30 | 0.8% |
| 9 | 41 | 1.1% |
| 10 | 20 | 0.5% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 2 | 0.1% |
| 18 | 2 | 0.1% |
| 16 | 12 | |
| 14 | 1 | < 0.1% |
| 13 | 4 | 0.1% |
| 12 | 28 | |
| 11 | 1 | < 0.1% |
| 10 | 20 |
bathroom
Real number (ℝ)
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4241436 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9479448 |
|---|---|
| Coefficient of variation (CV) | 0.56888526 |
| Kurtosis | 17.544566 |
| Mean | 3.4241436 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.2490529 |
| Sum | 12594 |
| Variance | 3.794489 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1077 | |
| 2 | 1048 | |
| 4 | 820 | |
| 5 | 294 | 8.0% |
| 1 | 156 | 4.2% |
| 6 | 117 | 3.2% |
| 9 | 41 | 1.1% |
| 7 | 40 | 1.1% |
| 8 | 25 | 0.7% |
| 12 | 22 | 0.6% |
| Other values (9) | 38 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 156 | 4.2% |
| 2 | 1048 | |
| 3 | 1077 | |
| 4 | 820 | |
| 5 | 294 | 8.0% |
| 6 | 117 | 3.2% |
| 7 | 40 | 1.1% |
| 8 | 25 | 0.7% |
| 9 | 41 | 1.1% |
| 10 | 9 | 0.2% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 3 | 0.1% |
| 18 | 4 | 0.1% |
| 17 | 3 | 0.1% |
| 16 | 8 | 0.2% |
| 14 | 2 | 0.1% |
| 13 | 4 | 0.1% |
| 12 | 22 | |
| 11 | 4 | 0.1% |
| 10 | 9 |
balcony
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| 3+ | |
|---|---|
| 3 | |
| 2 | |
| 1 | |
| 0 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.3186514 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4850 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3+ |
| 3rd row | 3 |
| 4th row | 0 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3+ | 1172 | |
| 3 | 1074 | |
| 2 | 885 | |
| 1 | 365 | 9.9% |
| 0 | 182 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| 2 | 885 | 24.1% |
| 1 | 365 | 9.9% |
| 0 | 182 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| + | 1172 | |
| 2 | 885 | 18.2% |
| 1 | 365 | 7.5% |
| 0 | 182 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 | |
| Math Symbol | 1172 | 24.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| 2 | 885 | 24.1% |
| 1 | 365 | 9.9% |
| 0 | 182 | 4.9% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1172 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4850 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| + | 1172 | |
| 2 | 885 | 18.2% |
| 1 | 365 | 7.5% |
| 0 | 182 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2246 | |
| + | 1172 | |
| 2 | 885 | 18.2% |
| 1 | 365 | 7.5% |
| 0 | 182 | 3.8% |
floorNum
Real number (ℝ)
ZEROS 
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 19 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.7993987 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 129 |
| Zeros (%) | 3.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 10 |
| 95-th percentile | 18 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 6.0120336 |
|---|---|
| Coefficient of variation (CV) | 0.88420078 |
| Kurtosis | 4.5142084 |
| Mean | 6.7993987 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.693111 |
| Sum | 24879 |
| Variance | 36.144549 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 498 | |
| 2 | 493 | |
| 1 | 351 | 9.5% |
| 4 | 316 | 8.6% |
| 8 | 195 | 5.3% |
| 6 | 183 | 5.0% |
| 10 | 179 | 4.9% |
| 7 | 176 | 4.8% |
| 5 | 169 | 4.6% |
| 9 | 161 | 4.4% |
| Other values (33) | 938 |
| Value | Count | Frequency (%) |
| 0 | 129 | 3.5% |
| 1 | 351 | |
| 2 | 493 | |
| 3 | 498 | |
| 4 | 316 | |
| 5 | 169 | 4.6% |
| 6 | 183 | 5.0% |
| 7 | 176 | 4.8% |
| 8 | 195 | 5.3% |
| 9 | 161 | 4.4% |
| Value | Count | Frequency (%) |
| 51 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 43 | 2 | |
| 40 | 1 | < 0.1% |
| 39 | 2 | |
| 38 | 1 | < 0.1% |
| 35 | 2 | |
| 34 | 2 | |
| 33 | 4 |
facing
Categorical
MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1046 |
| Missing (%) | 28.4% |
| Memory size | 57.5 KiB |
| North-East | |
|---|---|
| East | |
| North | |
| West | |
| South | |
| Other values (3) |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 6.8381459 |
| Min length | 4 |
Characters and Unicode
| Total characters | 17998 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | North-East |
|---|---|
| 2nd row | North-West |
| 3rd row | South-East |
| 4th row | East |
| 5th row | North-East |
Common Values
| Value | Count | Frequency (%) |
| North-East | 623 | |
| East | 623 | |
| North | 387 | 10.5% |
| West | 249 | 6.8% |
| South | 231 | 6.3% |
| North-West | 193 | 5.2% |
| South-East | 173 | 4.7% |
| South-West | 153 | 4.2% |
| (Missing) | 1046 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| north-east | 623 | |
| east | 623 | |
| north | 387 | |
| west | 249 | 9.5% |
| south | 231 | 8.8% |
| north-west | 193 | 7.3% |
| south-east | 173 | 6.6% |
| south-west | 153 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| E | 1419 | 7.9% |
| a | 1419 | 7.9% |
| N | 1203 | 6.7% |
| r | 1203 | 6.7% |
| - | 1142 | 6.3% |
| W | 595 | 3.3% |
| Other values (3) | 1709 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13082 | |
| Uppercase Letter | 3774 | 21.0% |
| Dash Punctuation | 1142 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| a | 1419 | 10.8% |
| r | 1203 | 9.2% |
| e | 595 | 4.5% |
| u | 557 | 4.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1419 | |
| N | 1203 | |
| W | 595 | |
| S | 557 | 14.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1142 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16856 | |
| Common | 1142 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| E | 1419 | 8.4% |
| a | 1419 | 8.4% |
| N | 1203 | 7.1% |
| r | 1203 | 7.1% |
| W | 595 | 3.5% |
| e | 595 | 3.5% |
| Other values (2) | 1114 | 6.6% |
Common
| Value | Count | Frequency (%) |
| - | 1142 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17998 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3774 | |
| s | 2014 | |
| o | 1760 | |
| h | 1760 | |
| E | 1419 | 7.9% |
| a | 1419 | 7.9% |
| N | 1203 | 6.7% |
| r | 1203 | 6.7% |
| - | 1142 | 6.3% |
| W | 595 | 3.3% |
| Other values (3) | 1709 |
agePossession
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| Relatively New | |
|---|---|
| New Property | |
| Moderately Old | |
| Undefined | |
| Old Property |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 13.385536 |
| Min length | 9 |
Characters and Unicode
| Total characters | 49232 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moderately Old |
|---|---|
| 2nd row | Relatively New |
| 3rd row | Relatively New |
| 4th row | Undefined |
| 5th row | Under Construction |
Common Values
| Value | Count | Frequency (%) |
| Relatively New | 1646 | |
| New Property | 594 | 16.2% |
| Moderately Old | 563 | 15.3% |
| Undefined | 306 | 8.3% |
| Old Property | 303 | 8.2% |
| Under Construction | 266 | 7.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| new | 2240 | |
| relatively | 1646 | |
| property | 897 | |
| old | 866 | 12.3% |
| moderately | 563 | 8.0% |
| undefined | 306 | 4.3% |
| under | 266 | 3.8% |
| construction | 266 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | 9.6% |
| t | 3638 | 7.4% |
| 3372 | 6.8% | |
| y | 3106 | 6.3% |
| r | 2889 | 5.9% |
| d | 2307 | 4.7% |
| N | 2240 | 4.5% |
| w | 2240 | 4.5% |
| i | 2218 | 4.5% |
| Other values (15) | 14068 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 38810 | |
| Uppercase Letter | 7050 | 14.3% |
| Space Separator | 3372 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | |
| t | 3638 | |
| y | 3106 | 8.0% |
| r | 2889 | 7.4% |
| d | 2307 | 5.9% |
| w | 2240 | 5.8% |
| i | 2218 | 5.7% |
| a | 2209 | 5.7% |
| o | 1992 | 5.1% |
| Other values (7) | 5057 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2240 | |
| R | 1646 | |
| P | 897 | |
| O | 866 | 12.3% |
| U | 572 | 8.1% |
| M | 563 | 8.0% |
| C | 266 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 3372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45860 | |
| Common | 3372 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | 10.3% |
| t | 3638 | 7.9% |
| y | 3106 | 6.8% |
| r | 2889 | 6.3% |
| d | 2307 | 5.0% |
| N | 2240 | 4.9% |
| w | 2240 | 4.9% |
| i | 2218 | 4.8% |
| a | 2209 | 4.8% |
| Other values (14) | 11859 |
Common
| Value | Count | Frequency (%) |
| 3372 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49232 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8433 | |
| l | 4721 | 9.6% |
| t | 3638 | 7.4% |
| 3372 | 6.8% | |
| y | 3106 | 6.3% |
| r | 2889 | 5.9% |
| d | 2307 | 4.7% |
| N | 2240 | 4.5% |
| w | 2240 | 4.5% |
| i | 2218 | 4.5% |
| Other values (15) | 14068 |
super_built_up_area
Real number (ℝ)
MISSING 
| Distinct | 593 |
|---|---|
| Distinct (%) | 31.6% |
| Missing | 1803 |
| Missing (%) | 49.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1925.2376 |
| Minimum | 89 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 89 |
|---|---|
| 5-th percentile | 767 |
| Q1 | 1479.5 |
| median | 1828 |
| Q3 | 2215 |
| 95-th percentile | 3185 |
| Maximum | 10000 |
| Range | 9911 |
| Interquartile range (IQR) | 735.5 |
Descriptive statistics
| Standard deviation | 764.17218 |
|---|---|
| Coefficient of variation (CV) | 0.39692356 |
| Kurtosis | 10.349191 |
| Mean | 1925.2376 |
| Median Absolute Deviation (MAD) | 372 |
| Skewness | 1.8364563 |
| Sum | 3609820.5 |
| Variance | 583959.12 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1650 | 37 | 1.0% |
| 1950 | 37 | 1.0% |
| 2000 | 25 | 0.7% |
| 1578 | 25 | 0.7% |
| 1640 | 22 | 0.6% |
| 2150 | 22 | 0.6% |
| 1900 | 19 | 0.5% |
| 2408 | 19 | 0.5% |
| 1930 | 18 | 0.5% |
| 2812 | 17 | 0.5% |
| Other values (583) | 1634 | |
| (Missing) | 1803 |
| Value | Count | Frequency (%) |
| 89 | 1 | |
| 145 | 1 | |
| 161 | 1 | |
| 215 | 1 | |
| 216 | 1 | |
| 325 | 1 | |
| 340 | 1 | |
| 352 | 1 | |
| 380 | 1 | |
| 406 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 1 | |
| 6926 | 1 | |
| 6000 | 1 | |
| 5800 | 2 | |
| 5514 | 1 | |
| 5350 | 2 | |
| 5200 | 2 | |
| 4890 | 1 | |
| 4857 | 1 | |
| 4848 | 2 |
built_up_area
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 644 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 1988 |
| Missing (%) | 54.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2379.5858 |
| Minimum | 2 |
|---|---|
| Maximum | 737147 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 240.45 |
| Q1 | 1100 |
| median | 1650 |
| Q3 | 2400 |
| 95-th percentile | 4691 |
| Maximum | 737147 |
| Range | 737145 |
| Interquartile range (IQR) | 1300 |
Descriptive statistics
| Standard deviation | 17942.88 |
|---|---|
| Coefficient of variation (CV) | 7.5403375 |
| Kurtosis | 1667.8704 |
| Mean | 2379.5858 |
| Median Absolute Deviation (MAD) | 650 |
| Skewness | 40.706572 |
| Sum | 4021500 |
| Variance | 3.2194695 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1800 | 41 | 1.1% |
| 3240 | 37 | 1.0% |
| 1900 | 34 | 0.9% |
| 1350 | 33 | 0.9% |
| 2700 | 33 | 0.9% |
| 900 | 28 | 0.8% |
| 1600 | 26 | 0.7% |
| 2000 | 24 | 0.7% |
| 1300 | 24 | 0.7% |
| 1700 | 23 | 0.6% |
| Other values (634) | 1387 | |
| (Missing) | 1988 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 50 | 3 | |
| 53 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 57 | 1 | < 0.1% |
| 60 | 5 |
| Value | Count | Frequency (%) |
| 737147 | 1 | < 0.1% |
| 13500 | 1 | < 0.1% |
| 11286 | 1 | < 0.1% |
| 9500 | 1 | < 0.1% |
| 9000 | 7 | |
| 8775 | 1 | < 0.1% |
| 8286 | 1 | < 0.1% |
| 8067.8 | 1 | < 0.1% |
| 8000 | 1 | < 0.1% |
| 7500 | 2 | 0.1% |
carpet_area
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 733 |
|---|---|
| Distinct (%) | 39.1% |
| Missing | 1805 |
| Missing (%) | 49.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2528.1194 |
| Minimum | 15 |
|---|---|
| Maximum | 607936 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 350 |
| Q1 | 837 |
| median | 1300 |
| Q3 | 1790 |
| 95-th percentile | 2950 |
| Maximum | 607936 |
| Range | 607921 |
| Interquartile range (IQR) | 953 |
Descriptive statistics
| Standard deviation | 22793.792 |
|---|---|
| Coefficient of variation (CV) | 9.0161059 |
| Kurtosis | 604.86092 |
| Mean | 2528.1194 |
| Median Absolute Deviation (MAD) | 475 |
| Skewness | 24.339718 |
| Sum | 4735167.6 |
| Variance | 5.1955696 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1400 | 42 | 1.1% |
| 1600 | 35 | 1.0% |
| 1800 | 35 | 1.0% |
| 1200 | 31 | 0.8% |
| 1500 | 29 | 0.8% |
| 1650 | 28 | 0.8% |
| 1350 | 27 | 0.7% |
| 1300 | 23 | 0.6% |
| 1450 | 22 | 0.6% |
| 1000 | 22 | 0.6% |
| Other values (723) | 1579 | |
| (Missing) | 1805 |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 48 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 72 | 1 | < 0.1% |
| 76.44 | 3 | |
| 77.31 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 607936 | 1 | |
| 569243 | 1 | |
| 514396 | 1 | |
| 64529 | 1 | |
| 64412 | 1 | |
| 58141 | 1 | |
| 54917 | 1 | |
| 48811 | 1 | |
| 45966 | 1 | |
| 34401 | 1 |
study room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2973 | |
| 1 | 705 | 19.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2973 | |
| 1 | 705 | 19.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2973 | |
| 1 | 705 | 19.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2973 | |
| 1 | 705 | 19.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2973 | |
| 1 | 705 | 19.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2973 | |
| 1 | 705 | 19.2% |
servant room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2350 | |
| 1 | 1328 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2350 | |
| 1 | 1328 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2350 | |
| 1 | 1328 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2350 | |
| 1 | 1328 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2350 | |
| 1 | 1328 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2350 | |
| 1 | 1328 |
store room
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3340 | |
| 1 | 338 | 9.2% |
pooja room
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3022 | |
| 1 | 656 | 17.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3022 | |
| 1 | 656 | 17.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3022 | |
| 1 | 656 | 17.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3022 | |
| 1 | 656 | 17.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3022 | |
| 1 | 656 | 17.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3022 | |
| 1 | 656 | 17.8% |
others
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3273 | |
| 1 | 405 | 11.0% |
furnishing_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.5 KiB |
| 1 | |
|---|---|
| 2 | |
| 0 | 209 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3678 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2405 | |
| 2 | 1064 | |
| 0 | 209 | 5.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2405 | |
| 2 | 1064 | |
| 0 | 209 | 5.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2405 | |
| 2 | 1064 | |
| 0 | 209 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3678 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2405 | |
| 2 | 1064 | |
| 0 | 209 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3678 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2405 | |
| 2 | 1064 | |
| 0 | 209 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2405 | |
| 2 | 1064 | |
| 0 | 209 | 5.7% |
luxury_score
Real number (ℝ)
ZEROS 
| Distinct | 161 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71.503535 |
| Minimum | 0 |
|---|---|
| Maximum | 174 |
| Zeros | 462 |
| Zeros (%) | 12.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 31 |
| median | 59 |
| Q3 | 110 |
| 95-th percentile | 174 |
| Maximum | 174 |
| Range | 174 |
| Interquartile range (IQR) | 79 |
Descriptive statistics
| Standard deviation | 53.054919 |
|---|---|
| Coefficient of variation (CV) | 0.74199016 |
| Kurtosis | -0.8797416 |
| Mean | 71.503535 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 0.45948542 |
| Sum | 262990 |
| Variance | 2814.8244 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 462 | 12.6% |
| 49 | 348 | 9.5% |
| 174 | 195 | 5.3% |
| 44 | 60 | 1.6% |
| 165 | 55 | 1.5% |
| 38 | 55 | 1.5% |
| 72 | 52 | 1.4% |
| 60 | 47 | 1.3% |
| 37 | 46 | 1.3% |
| 42 | 45 | 1.2% |
| Other values (151) | 2313 |
| Value | Count | Frequency (%) |
| 0 | 462 | |
| 5 | 6 | 0.2% |
| 6 | 6 | 0.2% |
| 7 | 41 | 1.1% |
| 8 | 30 | 0.8% |
| 9 | 9 | 0.2% |
| 12 | 6 | 0.2% |
| 13 | 10 | 0.3% |
| 14 | 12 | 0.3% |
| 15 | 43 | 1.2% |
| Value | Count | Frequency (%) |
| 174 | 195 | |
| 169 | 1 | < 0.1% |
| 168 | 9 | 0.2% |
| 167 | 21 | 0.6% |
| 166 | 10 | 0.3% |
| 165 | 55 | 1.5% |
| 161 | 3 | 0.1% |
| 160 | 28 | 0.8% |
| 159 | 23 | 0.6% |
| 158 | 34 | 0.9% |
| property_type | society | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | study room | servant room | store room | pooja room | others | furnishing_type | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | house | dlf city plots | sector 26 | 12.50 | 38580.0 | 3240.0 | Plot area 360(301.01 sq.m.) | 4 | 5 | 2 | 2.0 | North-East | Moderately Old | NaN | 3240.0 | NaN | 1 | 1 | 0 | 0 | 0 | 2 | 121 |
| 1 | flat | imperia the esfera | sector 37c | 0.99 | 5454.0 | 1815.0 | Super Built up area 1815(168.62 sq.m.)Built Up area: 1400 sq.ft. (130.06 sq.m.)Carpet area: 1270 sq.ft. (117.99 sq.m.) | 3 | 5 | 3+ | 5.0 | North-West | Relatively New | 1815.0 | 1400.0 | 1270.00 | 0 | 1 | 0 | 0 | 0 | 1 | 49 |
| 2 | flat | alpha corp gurgaonone | sector 84 | 1.49 | 7590.0 | 1963.0 | Super Built up area 1963(182.37 sq.m.)Carpet area: 1700 sq.ft. (157.94 sq.m.) | 3 | 3 | 3 | 6.0 | South-East | Relatively New | 1963.0 | NaN | 1700.00 | 0 | 1 | 0 | 0 | 0 | 1 | 28 |
| 3 | house | independent | sector 110 | 0.34 | 6800.0 | 500.0 | Built Up area: 500 (46.45 sq.m.) | 1 | 1 | 0 | 1.0 | NaN | Undefined | NaN | 500.0 | NaN | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 4 | flat | vatika sovereign next sector-82 a gurgaon | sector 82a | 2.91 | 6700.0 | 4343.0 | Super Built up area 4350(404.13 sq.m.)Carpet area: 3480 sq.ft. (323.3 sq.m.) | 4 | 4 | 3 | 6.0 | East | Under Construction | 4350.0 | NaN | 3480.00 | 1 | 1 | 1 | 0 | 0 | 2 | 79 |
| 5 | flat | indiabulls enigma | sector 110 | 4.00 | 9851.0 | 4061.0 | Built Up area: 3350 (311.23 sq.m.) | 4 | 4 | 3 | 17.0 | NaN | New Property | NaN | 3350.0 | NaN | 0 | 0 | 0 | 0 | 0 | 1 | 59 |
| 6 | house | independent | sector 31 | 3.50 | 24155.0 | 1449.0 | Plot area 161(134.62 sq.m.) | 4 | 4 | 3 | 2.0 | North-East | Old Property | NaN | 1449.0 | NaN | 0 | 0 | 1 | 0 | 0 | 2 | 70 |
| 7 | flat | signature global city 37d ph 2 | sector 37d | 1.10 | 8148.0 | 1350.0 | Carpet area: 1350 (125.42 sq.m.) | 3 | 3 | 1 | 2.0 | North | Under Construction | NaN | NaN | 1350.00 | 0 | 0 | 0 | 1 | 0 | 1 | 30 |
| 8 | flat | mapsko mount ville | sector 79 | 1.30 | 8024.0 | 1620.0 | Super Built up area 1620(150.5 sq.m.)Carpet area: 863.91 sq.ft. (80.26 sq.m.) | 3 | 3 | 3+ | 1.0 | North-East | Relatively New | 1620.0 | NaN | 863.91 | 0 | 0 | 0 | 0 | 0 | 2 | 140 |
| 9 | house | sushant lok 1 builder floors | sector 43 | 1.95 | 10077.0 | 1935.0 | Plot area 215(179.77 sq.m.) | 3 | 4 | 3+ | 4.0 | West | New Property | NaN | 1935.0 | NaN | 0 | 1 | 0 | 0 | 0 | 2 | 78 |
| property_type | society | sector | price | price_per_sqft | area | areaWithType | bedRoom | bathroom | balcony | floorNum | facing | agePossession | super_built_up_area | built_up_area | carpet_area | study room | servant room | store room | pooja room | others | furnishing_type | luxury_score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3793 | flat | central park resorts | sector 48 | 11.25 | 28266.0 | 3980.0 | Carpet area: 3980 (369.75 sq.m.) | 4 | 5 | 3+ | 2.0 | North-West | Relatively New | NaN | NaN | 3980.0 | 0 | 1 | 0 | 1 | 0 | 1 | 45 |
| 3794 | flat | suncity vatsal valley | gwal pahari | 1.00 | 13340.0 | 750.0 | Carpet area: 750 (69.68 sq.m.) | 2 | 2 | 2 | 3.0 | NaN | New Property | NaN | NaN | 750.0 | 0 | 0 | 0 | 0 | 0 | 2 | 13 |
| 3795 | house | nul | sector 25 | 9.50 | 33404.0 | 2844.0 | Plot area 316(264.22 sq.m.) | 9 | 9 | 3+ | 3.0 | North-West | Old Property | NaN | 2844.0 | NaN | 0 | 0 | 0 | 0 | 1 | 1 | 81 |
| 3796 | flat | espire south | sector 68 | 1.32 | 9050.0 | 1459.0 | Built Up area: 1460 (135.64 sq.m.) | 2 | 2 | 2 | 9.0 | NaN | New Property | NaN | 1460.0 | NaN | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 3797 | flat | trisara our homes 3 | sohna road | 0.36 | 5980.0 | 602.0 | Super Built up area 602(55.93 sq.m.)Carpet area: 450 sq.ft. (41.81 sq.m.) | 2 | 2 | 1 | 1.0 | NaN | New Property | 602.0 | NaN | 450.0 | 0 | 0 | 0 | 0 | 0 | 1 | 45 |
| 3798 | flat | corona optus | sector 37c | 1.25 | 7090.0 | 1763.0 | Super Built up area 1763(163.79 sq.m.)Built Up area: 1350 sq.ft. (125.42 sq.m.)Carpet area: 1234 sq.ft. (114.64 sq.m.) | 3 | 3 | 3 | 4.0 | North-West | Relatively New | 1763.0 | 1350.0 | 1234.0 | 0 | 0 | 0 | 0 | 1 | 1 | 49 |
| 3799 | flat | sare crescent parc | sector 92 | 0.50 | 5000.0 | 1000.0 | Super Built up area 1000(92.9 sq.m.) | 2 | 2 | 2 | 2.0 | NaN | Relatively New | 1000.0 | NaN | NaN | 0 | 0 | 0 | 0 | 1 | 1 | 72 |
| 3800 | house | raj villas | sector 52 | 8.00 | 25543.0 | 3132.0 | Carpet area: 348 (290.97 sq.m.) | 6 | 5 | 3+ | 4.0 | East | Undefined | NaN | NaN | 348.0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 3801 | flat | project krishna colony | sector 7 | 0.36 | 5142.0 | 700.0 | Built Up area: 700 (65.03 sq.m.) | 2 | 2 | 1 | 0.0 | NaN | Relatively New | NaN | 700.0 | NaN | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 3802 | house | independent | sector 46 | 6.50 | 27461.0 | 2367.0 | Plot area 263(219.9 sq.m.) | 12 | 12 | 3+ | 4.0 | North-West | Relatively New | NaN | 2367.0 | NaN | 1 | 0 | 0 | 1 | 0 | 2 | 32 |